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'ABSTRACT 

Determination cf the fairr.ess of preschool tests for 
"use with children of varying cultural backgrounds is the major 
obiective of this study. The predictive validity of a battery of 
preschool tests, chosen to represent the core' areas of preschool 
assessment, across race and sex, was evaluated. Validity of the 
battery was examined over a 12-month period, involving 322 
preschoolers (white and black, female and m^le) . A regression 
equation was "determined usina all preschool measures to predict 
achievement scores. Prediction? were made for each individual test, 
and residual terms were calculated. No significant differences 
occurred in mean residuals between any pair of groups, indicating an 
absence of bias in prediction across race and sex with the large 
battery. Sex bias in prediction was seen wher. subsets of the larger 
. battery were examined, it is suggested that. test developers become 
aware of the issue of bias and demonstrate differential construct and 
predictive validity as a part cf the development of a test. 
(Author/GSK) 
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Abstract 

♦ • 

The study evaluated the predictive validity of a battery 
of preschool tests, chosen to represent the core areas of pre- 
school assessment, across race and sex. Predictive validity 
of the battery was examined oyer a 12 month period utilizing 
322 preschoolers (90 white females, 86 white males, 73 black 
females, 73 black males). The preschool battery was administered 
at the end of kindergarten and the Mettopolitan Achievement Test 
at the end of first grade. A regression equation w^as determined 
using all preschool measures to predict achievement scores. Pre- 
dictions were made for each individual test, and residual terms 
were calculated. Residuals were submitted to a race by sex ANOVA 
for each subtest of the Metropolitan Achievement Test. No sig- 
nificant differences occurred in mean residuals between any pair 
of groups, indicating an absence of bias in prediction across 
race and sex with the large battery. When subsets of the larger 
battery were examined, sex bias in prediction was seen, indicating 
significant underprediction of female performance in some achieve- ' 
ment areas. However, the magnitude of the effect was small. 
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The past decade has wl.tnessed a substantial surges of In-^ 
t^prest In preschool assessment, and in the differentia^^ valid- 
ity of psychological tests when used with individuals of varying 
cultural backgrounds. The use of tests normed primarily vOn 
White children la of special concern to educators due to the 

specifications of P. }s. 94-142 (Education for* All Handicapp;ed 

/ 

Children Act of 1975) and the now famous Larry P. case in San 
F^ancisco^. Harrington (1975) *has gone so far as to state that 
it is not possible for tests developed and normed on a white 
majority to be other than biased against minorities, or to pre- 
dict accurately when used with minorities. 

» 

In reporting on the use of educational and psychological, 
tests with disadvantaged students, a conmittee appointed by > 
the American Psychological Association, Board of Scientific 
Affairs (Cleary, Humphreys, Kendrlcl^& Vesman, 1975) offered a 
definition. of test bias. While including content and con- 
struct validity as important variables in the Issue of bias, 
the emphasis was clearly on predictive validity: "A test is 
considered fair for a particular use if the inference drawn from 

I 

the test score is made with th^ smallest feasible randcrtn error 
and if t:here is no constant error Iq the inference as a function 
of membershi^p in a particular group" (p. 25). The definition 
of bias offered by the APA committee is a restatement of previous 
definitions hy Cardall and Coffman (1964), Cleary (1968), Pott- 
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Hoff (1966), and others, and has been widely accepted (though 
certainly not without criticism, e,g., Bemal, 1975; Thomdike, 
lj971). 

Oakland and Matuszek (1977) examined class placement pro- 
cedures under several proposed models of bias and demonstrated 
that application of the Cleary model results in the smallest 
number of children being misplaced, though under certain le^is-r 
lative conditions they favored^ the Thomdike selection model. 
Fot the present study, the Cleary etal. definition is stated in 
somewhat^ different (though reasonably equivalent) terms : A ■ 
battery of tests is considered fair if it signif idantly in- ' 
creases one's ability to predict the 9rlterion variable (i.e., 
R > 0) and ii errors\in prediction are independent of .group 
membership (i.e., Xej ^ « Xej^ - . . . gej^ ) . 

Much research is available on the validity of the ^AT for 
blacks and whites. A few. 8tudiei{ dealing with the predictive 
validity of IQ tests with elementary and secondary school child- 
j:en have recently appeared (e,g, , Reynolds & Hartlage, 1979). 
However, only a single study has been located dealing with the 
differential validity of preschool tests. Mitchell (1967) 
studied the, validity of two broad based readiness tests ixied to 
predict first grade achievement for blacks and whites, and found 
similar validity coefficients for the two races. However, Mit- 

% 

chell*s study was limited to comparing the magnitude of the 
independent- dependent variable relationship and did not look at 
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error terms when a common equation is applied to all groups. 

Since mos.t major preschool assessment devices (e.g.\ 
Wechsler Preschool and Primary Scale of Intelligence, McCarthy 
Scales of Children's Abilities)* require highly trained person- 
nfel and moie than one hotir of testing time per child, they arl 
typically not employed in large scale screening programs. Con- 
sequently, a vari**ty of short screening^ test/ and several group 
readiness tests have be-sn developed and normed for the preschooler. 
Available instruments include tests of basic concep.ts, criterion 
referenced "Achievement" tests, tests of visual-motor integration 
and nonverbal concept formation, and general readiness screening 
batteries (Reyno.lds, 1979,). The present investigation includes 
test6 representative of each of these skill areas. These tests 
atre frequently used4n placement decisions (e.g., regular first 
gra^e versus a developmental classroom) , deciding upon ref brrals 
for more comprehensive psychological evaluation, and in decisions 
6n early admissions to kindergarten or first grade. The study 

\ 

is designed to determine the fairness of preschool tests for use 
with children of- varying cultural backgrounds. More specifically, 
the following question is asked: When preschool tests representa- 
tlve of the core areas of preschool assessment are grouped into a 
battery used to predict first grade achievement, are the errors 
inherent, in statistical prediction (tesidual terms) randomly 
distributed by race and sex when a common multiple regression 
equation is ap^Slled to all scores? 
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METHOD 

Subjects 

The initial' sample for pretesting consisted of 393 kinder- 
garten children from a school district located in a small metro- 
politan area in the southeaste^ United States. Complete data 
vere obtained on 322 of the 393^ children, representing an at- 
trition rate of 18% over the one year period. The final sample 
contained 90 white females (WF) , 86 vhite males (WM) , 73 blaok ^ 
females (BF). arid 73 black males (BM). The total group had a . 
.mean age of 82.57 months at the time of post-testing (end of 
first grade), with no race or sex differences by age. The sample 
was primarily middle to lower-middle class in SES ratings .y^ with 
the white SES rating significantly higher than the blacks * (t - 
29.5. df- 320. £ < .001). The children are described in greater 
detail in Table 1 and elsewhere (Reynolds. 1978a). 



, Insert Table 1 about here 

» 
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« 

Test Instruments 

Pretesting was conducted in May of the kindergarten^^ear on 
each of the following tests, with the exception of the Metro- 
polltan Readiness Test which was administered during the second . 
week of first grade. Pretests Included: 

I) The KcCar thy Drawing Tests: Draw-A -Design (DAD) and 
Draw-A-Chlld (DAC). DAD and DAC are subtests 12 and 13 of the 



« 
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'McCarthy Scales of Children's AbilitiBS (McCarthy, 1972). The 
two te^ts measure primarily visual -motor integration and non- 
verbal concept formation. Although designed to be ^individually 
administered, Reynolds ^1978b) found that the two tests- can be / 
validly administered in a group setting with the obtained scores 
showing significant correlation (typically around .50) 'with 
ifieasures of achievement. Reynolds also obtained test-retest 
reliabilities frori group to individual administration for DAD and 

DAC of .86 and .82, .respectively. In the present study, the two 

* i 

tests were administered in groups of 8 to 10 by k kindergarten 
teacher and aide trained i|^ the administration by the investi- 
gator. Scoring was -done by a doctoral Level school psychologist 
with no knowledge of the child's race or age. / 

2) Lee-Clark Reading Readiness Test (Lee & Clark, 1962) : 
The Lee-Clark is a group administered reading readiness test which 
requires the matching and differentiation of letters, basic con- 
cept' recognition, and letter and word identification. The Manual 
reports split-half reliabilities, corrected by Sp-iarman-Brown, 

^ of \96 for two separate samples of kindergarten pupils (N*s - 94 
anh 80). Validity coefficients hovering about .50 with several 
first grade reading tests are also reported in the Manual . 

3) Tests of Basic Experiences (TOBE; Moss, 1970): The TOBE 
are a series of five standardized group tests for young children. 
For the present study, two of the tests are administered: Mathe- 
matics and Language. The TOBE Mathematics subtest, according 
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to the author, measures the child *6 mastery of basic math con- " 
cepts and the terms associated vith them (e.g., biggest, "^Ideist, 
most, etc.). The TOBE Language .test primarily deals with vo- 
cabulary, sound-symbol relationships, listening skills, and ^ 
letter recognition. The TOBE Manual reports KR2Q reliability 
estimates of .80 for Mathematics and .8A for Language with a 
sample of 700 kindergarten children. Only evidence of content 
validity is given in the TOBE Manual, which is based on teacher 
judgments as to the correct classification of each item (i.e., 
Math, Language, Science, etc.). A high percentage of agreement 
was obtained between . teachers ' classifications of the items and 
the items' actual classification. 

A) Preschool Inventory - Revised Edition (Educational 
Testing Service, I/O): The Preschool- Inventory is divided into 
four subtests measuring Personal, -Social Responsiveness, Associ- 
ative Vcrcabulary, Concept Activation-Numerical, and Concept 
Activation- Sensory. The Inventory is individually administered 
and' consists of a series* of general information questions and 
simple Activities such as design copying. The l^ventojy. was 
carefully normed according to U.S. Census data. KR^q reliability 
estimates ranging from .86 to .92 are reported for the standard- 
ization sample at various age ranges with a total sample KR2Q of 
.91. Correlations with the Stanford-Binet Intelligence Scale are 
reported in the Manual and range from .39 to .65 with a total 
sample correlation of .44. The sample consists of 1,476 children 
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divided into five age groups between 3-0 and 6-11 ^or norming 
and other statistical treatments » For the present study, the 
inventory was administered individually to each student and 
scorer] by a State of Georgia Department of Education certified 
school psychologist. 

5) lietropolitan Readiness Tests (Hildreth, Griffiths, & 
McGauvran, 1969); The MRT were devised as a group test to measure 
the various skills in young children yhich contribute to readiness 
for first grade, and according to the authors, provfde a "quick, 
convenient, and dependable basis for the early classification of 
pupils . ." Six basic subtests and one alternate,' an adapta- 
tion of the Goodenough-Harris Drawing Test, are provided measuring 
vdcal>ulary, visual-motor integration, verbal compjehension, let- 
ter identification, and a variety of other' subskills (e.g., visual 
perception and diiscrimination) . Split-half reliabilities are' re- 
ported in the MRT Manual and range from .90 to .95 for the total 
test with seven different samples. Subtests* reliabilities are ^ 
considerably lower ranging from .33 to .89 across the 
Alternate form reliabilities 'range from .89 to .9,3 for total test 
and .50 to .86 for individual subtests'. Numerous validity studies 
are reported in the Manual and show correlations with later a~ 
chievemexit measures consistently hovering about .50. Due to the 
results of several factor analytic studies (Reynolds, 1979; 
in press), only the MRT total test score was used in the present 
study. 

• ' 10 
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The criterion measure chosen was the Metropolitan Achieve- 
ment Tests, Primary I uattery OUT). The MAT was chosen for 
several reasons. Within the school system in which data were 
"collected, the MAT had been chosen by the first grade teachers 
from a selection of standardized first grade achievement tests 
for five consecutive years as the test which most closely mea- 
sured what they taught. While this consideration is of primary 
importance, Anastasi (1976) also singles out the MAT as an ex- 
emplary model" of an achievement test designed to eliminate or 
reduce racial bias in item content. The MAT is a group admini- 
■ steried test yiel4in| subscores^for Word Knowledge, Word Dis- 
crimination, Reading, and Arithmetic. Split-half reliabilities 
for the subtests range from .81 to .95 with a median Xetiability 
of .91. . • ^ 

Procedure • ' 

During the last- two weeks of May, 393 of approximately 400 
kindergarten children we're admin^^ered the two McCarthy Drawii^g ' 
Tests, the two TOBE subtests, the Lee-Clark Reading Readiness 
Test, and the Preschool Inventory. The following fall during the 
first two weeks of school, all first grade children were adroini- ' * 

stered the l^T. In May of the first grade year, all students 
were administered the MAT. ^ Any student with missing data for any 
test was eliminated. from all analyses, leaving 322 children'. 
Data Analysis 

— r- — T— ' . ^ 

Since the age range of the sample was nearly two years, all 
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raw scores were convei^ted' to standard scores based on data avail- 
able in their respective manuals. Due to a lack of information 
concerning the raw score distributions of the MRT and the MAT, 
scores from these two tests were converted to percentile ranks . 
A* multiple regression equation was computed employing all inde- 
pendent variables in the prediction of MAT scores for the total 

♦ t • 
w 

sample. An equation was calculated for each MAT subtest. In- 
cluding all independent variables in the multiple regression for- 
mula, instead of u^ng a stepwise procedure when examining for 
bias, increases reliability (Linn & Werts, 1971)^ and lessens • 
th^ probability of finding artifactual differences. Using the 
equations for N-322, predicted MAT scores were obtained for. each 
subtest. Residuals were then calculated. To determine whether 
reiSiduals were randomly assigned by race and sex^a 2x2 ANOVA 
(race by sex) was calculated on the resldtial terms, producing ^ 
four separate analyses (one for each MAT subtest), the liult 



hypothesis in^ach case being: Hq: v. - v " v., - .ry . 

*BM ^WM ^BF ^SvF 



Use of the ANOVA apjjroairh to this question allows for examination 
of. the race by sex interaction in the prediction of first grade 
achievement from preschool screening test scores, and is a 
straight- f-orward test of the Cleary et al. (1975) condition that 
"there is no constant error in the inference as a function of 
membership in a .particulat group." A signficant F - ratio in 

* i » 

this case indicates nonrandom error and thus bias in prediction. 
Since four separate AMOVAs were calculated, a significance level 
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E 1 '01 was adppted. 
As an internal replication of the study, two of the pre- 

« 

tests were chosen (TOBE -Language and the Lee-Clark) to be 
examined in combination without including the other pretests. 
Regte^sion equations for each dependent variable with the 
TOBE-Language and Lee-Clark as predictors were calculated for 
N - 322 and the analysis of residuals repeated as described , 
above. $ince most school districts are likely to apply only 
one or two screening tests (instead of the large battery of the 
present study) , this analysis allows examination of the more 
practical implications of the present study. One' further 
analysis was conducted to also aid the generalizability of 
results across tests and to practical screening settings. 

The MRT is" certainly one of the most widely employed 
screening tests in the public schools (Reynolds, 1979; in press), 
frequently being the only screening measure employed. In the 
current stiid'y, the MRT ^howed the largest zero-order correla- 
tion with the dependent! measures . For these reasons, a regres- 
sion analysis of predic'tive bias by the MRT was imder taken via 
the Potthoff (1966) procedure. With a single independent varia-; 
bl§,, the Potthoff procedure is a computationally simple compar- 
ison of regression lines that provides a single F-ratio simul- 
taneously testing slope and intercept values. If a significant 
F results, separate tests of slopes and intercepts may then be 
conducted to determine if the bias in prediction is constant 
(intercepts diffSr) or changes "(^ith distance of the score from 

ERIC > ^ ' 15 
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• the mean (slopes differ). The results of the'Potthoff analysis 
are interpreted consistent with the analysis of residuals since 
if regression lines are simultaneous across groups, residuals 
must also be equal. 

RESULTS AND DISCUSSION 
The mean performance for each subgroup within the total 
^^ample of 322 is discussed in detail in Table 1. ^s a general 
rule, performance on the pretests and posttests rank ordered 
WF>WM>BF>BM» with the notable exception, pf the McCarthy Draw- 
A-Child subtest which ordered WF>BF>WM>BM. 

Summaries of the results of the four ANOVAs for the total 
battery are presented in Table 2. No significant differences in 

Insert Table 2 about here 

< 

the average errors of prediction for any of* the dependent varia- 
bles occurred by race or sex and no race by sex interactions 
/ proved significant at the .01 level of probability. Thus, the 
mean error of prediction for all race/sex groupings was essen- 
tially equivalent and approached zero in all cases. Had the 
£ <. .05 level been adopted, a single significant finding would 
have resulted, showing slight underprediction of performance for 
females on the MAT - Word Knowledge subtest. 

In the multiple regression equation for each of the dependent 
measures, the MRT maintained the largest Beta weight in all cases 
except one, giving it the greatest influence in the prediction 

ERIC 
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equation. As seen in Table 3, Rs were in the high .608 and low 
.708. The MRT also had the largest zfero-order correlation with 
each criterion variable when compared to th^other pretest scores. 
While this may be a function of the large number of abilities 
tapped by the MRT, it must be recalled that the MRT was admini^ 
stered three months closer to the criterion measures. The actual 
'regression equations are reported in Table 3. 



Insert Table 3 about h';re 
■ I 

When the MRT was examined alone as a predictor of first 
grade academic attainment, no bias in prediction occurred for 
any group. None of the F tests for differences in regression 
lines between groups even approached statistical significance 
(£ <^ .01) for any dependent variable. Fs ranged only from 0.15 
to 3.16. 

Some differences did occur when only the TOBE-Language and 
Lee-Clark were used as predictors however. Summaries of the 
results of the four ANOVAs for the two- test regression equations 
are presented in Table 4. No main effects occurred for race and 



Insert Table 4 about here 

there were no significant race by sex interactions for any of 
the dependent measures. With regard to sex, main effects 
occurred for the MAT-Word Discrimination and Word Knowledge 
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subtests but not for Reading or Arithmetic. Examination of 
the mean residuals for each group for Word Discrimination and 
Word Knowledge showed consistent underprediction of performance 
by females on each measure. The magnitude of the effect was 
small however, being on the order of .16 standard deviations on 
Word Discrimination and.. 13 on Word Knowledge. This translates 
into approximately 2-3 percentiles. This finding is consistent 
with a trend noted in the analysis of the larger battery. These 
two screening tests would tend to over identify females as 
potential candidates for remedial or special education classes. 
However, it is interesting to note that special education cla83es 
as a rule are more sexist than racist in makeup with boys out- 

I 

numbering girls 4:1. While the finding of sex bias in prediction 
is interesting and indicates a real need for follow-up study with 
a variety of other tests, the small magnitude of the effect, 
specificity of appearance (occurring only in two rather limited 
areas of achievement), and the relatively small percentage &f 
fe^iales in special education programs indicate few problems of 
practical significance in the use of these tests in preschool 
screening. 

The present study does not indicate the presence of bias in 
the prediction of first grade achievement across race and sex - 
when a large battery of tests is employed. Racial bias in pre- 
diction also did not occur for the single test nor the two'test 
battery. Sex effects with the two-test battery suggest r need 
for testing a diverse sample of skills to avoid bias in predic- 
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cion. The MRT tests a large number of abilities as does the 
larger battery; the two- test battery employed is more specific 
to reading and language, related skills. While testing a wide 
range of abilities may help insure against bias in prediction, 
the small magnitude of the effect would not support the use of 
mote than several tests under the scrutiny of any reasonable 
cost-benefit analysis. Use of the MRT as an integral part of a 
screening program seems to be one of the most reasonable alter- 
natives. However, much replication and further research will 
be needed before the effects of race and sex on our ability to 
predict early academic status if fully understood. The researcher 
and practitioner should be aware of the potential for bias existent 
in working with preschool populations.. Particular attention 
- should be awarded the sex variable. It has been long Ignored in 
studies of differential validity of cogni<:ive batteties, although 
it has received the attention of person.allty res.earchers , at 
least with regard to construct validity (e.g., Katzenmeyer & 
Stenner, 1977; Ozehosky & Clark, 1971). ' 

Test developers also need to become aware of the issrie of 
bias to the point of demonstrating differential construct and 
predictive validity as a part of the development of a test. Test 
authors and publishers need to demonstrate factorial invariance 
across all groups for whom the Instrument is designed, in order to 
make the instrument more readily lnterpretabl#. Comparisons of 
pzedictive validity across race and sex during the test develop- 

Iff ' 
i 
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ment phase are also desirable. At this stage, tests can be 
altered through item analysjls techniques to* eliminate racial 
And/or sexual bias in the prediction of performance. Studies 
similar to the p^resent investigation are needed with existing 
mefisuremen't instruments to "determine idiether alterations in 
interpretation are needed when applied to various populations. 
A variety of criteria also need to be employ e^^ including other 
group achievement measures, individual tests of academic 
attainment, and teacher-made tests. This is ^rue for the af^^ct 
ive as well as the cognitive domain (Reynolds, 1978a). 



Table i 

»^eans and SD's for each race/sex grouping on all pretests, posttests, age,. and SES. 



Group Identification 



Variable Name 


White Females * 


White Males ' 


Black Females 


Black 


Males 


V 


V 
A 


ou ^ 


^ X 


SO 


tr 
A 


SO 


J 


SO 


TOBE-Language 


111.21 


11.23 


106.67 


12.72 


88.97 


14.48 


90.58 


12.54 


TOBE-Math , 


' 106.84 


10.48 


105.10 


11.22 


68.19 


12.61 


^88.81 


16.99 


Preschool Inventory 


119.43 


6.88 


. 117.43 


11.28 


105.48 


11.65 


104.19 

• 


12.14 


Lee-Clark 


114.21 


8.83 


106.28 


18.40 

> 


93.22 


^.40 


92.48 


22.30 


Oraw-A-Design 


93.61 


15.31 


91.22 


14.73 


88.08 


13.84 


78.84 


13.08 


Draw-A-Chlld 


103.89 


18.28 


96.40 


15.84 


98.90 


17.82 


85.41 


16.26 


NRT-Total Test 


67.59 


14.16 


62.59 


17.82 


50.23 


14.89 


44.95 


14.20 


NAT-Word Knowledge 


69.61 


25.64 

• 


59.26 


32.10 


51.44 


26.11 


38.30 


24.94 


MAT-Uord Discrimination 


69.63 


24.30 


60.98 


27.30 


53.85 


26.18 


AA Atk 


9t Ae\ 


HAT-Reading 


66.11 


28.69 


56.71 


' 31. 32 


44.37 


32.47 


36.77 


25.68 


MAT-Arlthmetic 


71.91 


19.99 


67.12 


20.46 


53.41 


19.99 


49.14 


19.51 


SES ^ 


2.42 


1.14 


2.52 


1.15 


3.58 


1.04 


3.73 


0.98 


"^Afle (In months) 


82.56 


3.60 


82.59 


3.87 


82.58 


3.57 


82.53 


3.98 
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Table 2 



Analysis of Variance SufSamry Zables for Analysis 
of Residuals Based on All Independent Variables a^ 

Predictors 
I. MAT - Word Discrimination 



Source 




df 


MS 






Racc^ > 




1 


721. lA 


• 


1.58 


Sex 


• 


1 


462.62 


« 


1.02 


Race X Sex 




1 


138.13 




0.30 


Error 




318 


454.79 






II. MAT - 


Word Knowledge 










Source 




df 


MS 




P* 


Race 




1 


326.86 




0.61 


Sex ' 




1 


2629.52 




5.48 


Race X Sex 




1 


350.92 




0.73 


Error 




318 


48$. 08 






III. MAT - 


ReadinR 










Source 




df 


MS 






Race 




1 


74S.87 




1.49 


Sex 




1 


137.71 




0.27 


Race X Sex 




1 


5.17 




0.01 


Error 




318. 


• 504.46 






IV. MAT - 


Arithmetic 










Source 




df 


MS 






Race 




I 


77.30 




oAi , 


^ex 




1 


167.99 




> 0.68 


Race X Sex 




1 


287.80 




1.17 


Ezror 


* 


318 


246.36 







2 i 

* none significant at £ <. .01 



Table 3 

Multiple Regression Rquatibns for the Prediction of HAT Scores Employing 
All Independent Variables 



Dependent 
Variable 



MAT - 
Word Knowledge .67 
MAT - Word 

Discrimination .66 
MAT - 

Reading .71 
MAT - 

Arithmetic . 73 



« 

Prediction Equation 

.72 (MRT)* - .33 (TOBE^Language)* + .12 (DAC)* + .09 (Lee-Clark)* + 
.09 (Preschool Inventory) •♦• .0$ (TOBE-Math) + .03 (DAD) - 37.88. 
.63 (MKT)* + .29 (TOBE-Language)* + .15 (DAC)* + .15 (Lee-CXark)* + 
.05 (DAD) + .05 (TOBE-Math) + .002 (Preschool Inventory) - 33,. 67. 
.70 (MRT)* + .A9 (TOBE-Language)* + .^3 (DAC)* + .16 (Lee-Clark)* + 
.01 (Preschool Inventory) + .01 (DAD) + .002 (TOBE-Math) - AO. 78. 
.48 (Preschool Inventory)* .32 (MRT)* -f .21 (TOBE-Language)* + 
.IgStLee-Clark)* -f .07 (TOBE-Math) + .003 (DAC) + .001 (^AD) - 46.14. 



T 



♦variables contributing at a statiscially signifianct level (£ < .05) to the prediction 
of the dependent variable 
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. Table 4 

Analysis of Variance Sunnnary Tables for Analysis of 
Residuals Based on TOBE-Language and Lee-Clark as Predictors 



I. 


MAT - 


Word Discriminaticri . 


R - .58 




Source 




• 


df 


MS 


F 


Race 




• 


1 


105.11 


" * 

1.63 


Sex 






1 


427.39 


6.64* 


Race X 


Sex „ 




1 ' 


138;>Sip 


2.15 


Error 






318 


64.40 




II. 


MAT - 


Word Knowledge 




R - .58 




Source 




* 


df 


MS 


F 


Race 






1 


19.01 


0.30 


Sex 






1 


746 . 70 


11.90* 


Race X 


Sex 




.1 


220.33 


3.51 


Error 




• 


318 


62.71 




III. 


MAT - 


Reading 




R - .63 




Source 




9 


df 


MS 


F 


Race 


* 




1 


79.68 


.1.35 


Sex 




• 1 


1 


244.80 


. • 4.14 


Race x 


Sex 




1 


80.22 ■ 


1.36 


Error 






318 


59.09 




IV. 


MAT - 


Arithmetic 




R - .68 




• 

Source 






df 


MS « 


F 


Race 




• 


i 


1.51 


0.03 


Sex 






1 


75.14 


1.42 


Race X 


Sex 




1 


130.48 


2. .47 


Error 






318 


32.91 





* significant at £ <. .01 
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Notes 

^arry P. et al v. Riles et al.. 343 F. Supp. 1306 (D.C.N.D. 

Cel.. June 20. 1972). 

2 

litis procedure is conceptually similar to analysis of co- 
variance and tends to provide slightly inflated F- ratios (see 
Cochran & Cox, 1957, for a statistical explanation) thus 
giving a "conservative" test of the question of bias. That is, 
this procedure is more likely to overestimate the presence 
of bias than are mdre exact, but computationally complex, pro- 
cedures . 
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